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Abstract 

We present a system to translate natural language sentences to formulas in a formal or a knowledge 
representation language. Our system uses two inverse A-calculus operators and using them can take as 
input the semantic representation of some words, phrases and sentences and from that derive the semantic 
representation of other words and phrases. Our inverse A operator works on many formal languages 
including first order logic, database query languages and answer set programming. Our system uses a 
syntactic combinatorial categorial parser to parse natural language sentences and also to construct the 
semantic meaning of the sentences as directed by their parsing. The same parser is used for both. In 
addition to the inverse A-calculus operators, our system uses a notion of generalization to learn semantic 
representation of words from the semantic representation of other words that are of the same category. 
Together with this, we use an existing statistical learning approach to assign weights to deal with multiple 
meanings of words. Our system produces improved results on standard corpora on natural language 
interfaces for robot command and control and database queries. 



1 Introduction 

Our long term goal is to develop general methodologies to translate natural language text into a formal knowl- 
edge representation (KR) language. In the absence of a single KR language that is appropriate for expressing 
all the nuances of a natural language, currently, depending on the need different KR languages are used. 
For example, while first-order logic is appropriate for mathematical knowledge, one of its subset Descrip- 
tion logic is considered appropriate for expressing ontologies, temporal logics are considered appropriate 
for expressing goals of agents and robots, and various non-monotonic logics have been proposed to express 
common-sense knowledge. Thus, one of of our goals in this paper is to develop general methodologies that 
can be used in translating natural language to a desired KR language. 

There have been several learning based approaches, mai n ly from two groups at MIT and Austin. These 
include the following works : 'Zettlemover and Collins' /loos'), 'Kate and MooneV ('2006') , Wons and MooneM 




Wong and MoonevI (2007), Lu et al. (2008), Zettl emover and Collins (2007) and Ge and Mooney 
Given a training corpus of natural language sentences coupled with their desired representations, 
these approaches learn a model capable of trans l ating sentences to a desired meaning representation. For 



example, in the work by ^Zettlemover an d CoUinsI ( 2005 ^). a set of hand crafted rules is used to learn syntactic 



categories and semantic representations of words based on combinatorial categorial grammar (CCG), as 
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described bvlSteedman (l2000h . and A-calculus formulas, as discussed by iGamutI (Il99lh . The later work of 
Zettlemoyer and CollinsI (120071) . also uses hand c rafted rules. The Austin group has several papers over the 
years. Many of their works including the one by iGe and Mooneyl (i2009) use a word alignment method to 
learn semantic lexicon and l earn rules for composing meaning representation. 

Similar to the work by lOe and Mooneyl (120091), we use an existing syntactic pa rser to parse natural 
language. However we use a CCG parser, as described by Clark and CurranI ( 2007 ). to parse sentences, 
use lambda calculus for meaning representatio n, use the CCG parsing to compose meaning and have an 
initial dictionary. Note that unlike the work by Ge and Mooney (|2009), we do not need to learn rules for 
composing meaning representation. We use a novel method to learn semantic lexicon which is based on two 
inverse lambda operators t hat allow us to compute F given G and H such that F@G = H or G@F = H. 
Compared to the work by IZettlemoyer and CollinsI (l2005h . we use the same learning approach but use a 
completely different approach in lexical generation. Our inverse A operator h as been tested to w ork for many 
languages including first order logic , database qu ery language, CLANG by IChen et all (l2003h . answer set 
programming (ASP) as described by lBarall (|2003l) . and temporal logic. Thus our approach is not dependent 
on the language used to represent the semantics, nor limited by a fixed set of rules. Rather, the new A-calculus 
formulas and their semantic models, corresponding to the semantic or meaning representations, are directly 
obtained from known semantic representations which were provided with the data or learned before. The 
richness of A calculus allows us to rely only on the syntactic parse itself without the need to have separate 
rules for composing the semantics. The provided method yields improved experimental results on existing 
corpora on robot command and control and database queries. 



2 Motivation and Background 

We now illustrate how one can use CCG parsing and A-calculus applications to obtain database query repre- 
sentation of sentences. We then motivate and explain the role of our "inverse A" operator. A syntactic and 
semantic parse tree for the sentence "Give me the largest state." is given in Table [T] 

Give me the largest state. 



S/NP NP/N 
S/NP NP/N 
S/NP WP~ 



N/N 



N 



N 



Give me 

Xx.answer{A, x@A) 
Xx.answer{A, x@A) 
Xx.answer{A, x@A) 



the 

Xx.x 

Xx.x 



largest 

Xx.Xy.largest{y, x@y) 



state. 

Xz.state(z) 



Xy.largest{y, state{y)) 



Xy.largest{y, state{y)) 



answer{A, largest{A, state{A))) 



Table 1 : CCG and A-calculus derivation for "Give me the largest state." 



The upper portion of the figure lists the nodes corresponding to the CCG categories which are used to 
syntactically parse th e sentence. These are assigned to each word and then combined using combinatorial 
rules, as described by ISteedman (l2000l) . to obtain the categories corresponding to parts of the sentence and 
finally the complete sentence itself. For example, the category for "largest", N/N is combined with the 
category of "state.", A^, to obtain the category of "largest state.", which is N. In a similar manner, each 
word is assigned a semantic meaning in the form of a A-calculus formula, as indicated by the lower por- 
tion of the figure. The language used to represent the semantics of words and the sentence is the database 
query language used in the robocup domain. The formulas corresponding to words are combined by ap- 
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plying one to another, as dictated by the syntactic parse tree to obtain the semantic representation of the 
whole sentence. For example, the semantics of "the largest state.", Xy.largest{y, state{y)) is applied to 
the semantics of "Give me", Xx.answer{A, x@A), to obtain the semantics of "Give me the largest state.", 
answer{A, largest{A, state{A))). 

The given example illustrates how to obtain the semantics of the sentence given the semantics of words. 
However, what happens if the semantics of the word "largest" is not given? It might be either missing 
completely, or the current semantics of "largest" in the dictionary might simply not be applicable for the 
sentence "Give me the largest state.". 

Let us assume that the semantic representation of "largest" is not known, while the semantic represen- 
tation of the rest of the sentence is known. We can then obtain the semantic representation of "largest" as 
follows. Given the formula answer{A, largest{A, state{A))) for the whole sentence "Give me the largest 
state." and the formula Xx.answer{A, x@A) for "Give me", we can perform some kind of an inverse ap- 
plication^io obtain the semantics representation of "the largest state", Xy.largest{y, state{y)). Similarly, 
we can then use the known semantics of "the", to obtain the semantic representation of "largest state." as 
Xy.largest{y, state{y)). Finally, using the known semantics of state, Xz.state{z) we can obtain the the 
semantics of "largest" as Xx.Xy.largest{y,x@y). 

It is important to note that using @ we are able to construct relatively complex semantic representations 
that are properly mapped to the required syntax. 

Given a s et of training sent e nces w ith their desired semantic representations, a syntactic parser, such 
as the one by IClark and CurranI (120071) . and an initial dictionary, we can apply the above idea on each of 
the sentences to learn th e missing semantic repre s entati ons of words. We can then apply a learning model, 
such as the one used by IZettlemoyer and Colhnsl (120051) . on these new semantic representations and assign 
weights to different semantic representations. These can then be used to parse and represent the semantics 
of new sentences. This briefly sums up our approach to learn and compute new semantic representations. It 
is easy to see that this approach can be applied with respect to any language that can be handled by "inverse 
A" operators and is not limited in the set of new representations it provides. 

We will consider t wo do mains to evaluate our approach. The fist one is the GEOQUERY domain used 
by IZelle and Mooney (| 19961) . which uses a Prolog based language to query a database with geographical 
information about the U.S. It should be noted that this language uses higher-or der predicates. An example 
query is provided in Table [T] The second domain is the ROBOCUP domain of IChen et all (|2003l ). This is 
a multi-age nt domain where a gents compete against each other in a simulated soccer game. The language 
CLANG of IChen et al.l (|2003l ) is a formal language used to provide instructions to the agents. An example 
query with the corresponding natural language sentence is given below. 



• If the ball is in our midfield, position player 3 at (-5, -23). 

• ((bpos (midfield our)) (do (player our 3) (pos (pt -5 -23)))) 



3 Learning Approach 



We adopt the learning model given by IZettlemoyer and CoUinsI (I2OO5I . l200i liopj) and use it to assign 
weights to the semantic representations of words. Since a word can have multiple possible syntactic and 
semantic representations assigned to it, such as John may be represented as John as well as Xx.x@John, 
we use the probabilistic model to assign weights to these representations^; 



The main differences between our algorithm and the one given bv IZettlemover and CoUinsI (120051) are 
the way in which new semantic representations are obtained. While Zettlemoyer and Collins ( 20051) uses a 



Thus instead of applying G to F to obtain H, G@F = H, we try to find an F such that G@F = H given G and H. 
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predefined table to obtain these, we obtain the new semantic representations by using inverse A operators 
and generalization. 



3.1 Learning model and parsing 

We assume that complete syntactic parses are availabl^ The parsing uses a prob a bilisti c combinatorial 
categorial grammar framework similar to the one given by IZettlemoyer and Colhns ( 2005 ). We 



assume a 



probabilistic categorial grammar (PCCG) based on a log linear model. Let S denote a sentence, L denote 
the semantic representation of the sentence, and T denote it's parse tree. We assume a mapping / of a triple 
(L, T, S) to feature vectors i?*^ and a vector of parameters G i?'^ representing the weights. Then the 
probabihty of a particular syntactic and semantic parse is given as: 



PiL,T\S;e) = ^ 



J{L.T.S).e 



ef(L,T,S).B 
(L.T) ^ 

We use only lexical features. Each feature fj counts the number of times that the lexical entry is used in 

T. 

Parsing a sentence under PCCG includes finding L such that P{L\S; 0) is maximized. 

argmaxLP{L\S]@) = 
argmaxL J2t (-^j T\S; Q) 

We use dynamic programming techniques to calculate the most probable parse for a sentence. 
3.2 The inverse A operators 



For lac k of space, we present only one of the two Inverse A operators. Inverse l and InverseR of lGonzalez 



(l2010h . The objective of these two algorithms is that given typed A-calculus formulas H and G, we want to 



compute the formula F such that F@G = H and G@F = H. First, we introduce the different symbols used 
in the algorithm and their meaning : 

• Let G, H represent typed A-calculus formulas, J^,J^,...,J" represent typed terms, vi to Vn, v and w 
represent variables and cri,...,(T„ represent typed atomic terms. 

• Let /() represent a typed atomic formula. Atomic formulas may have a different arity than the one 
specified and still satisfy the conditions of the algorithm if they contain the necessary typed atomic 
terms. 

• Typed terms that are sub terms of a typed term J are denoted as Jj. 

• If the formulas we are processing within the algorithm do not satisfy any of the if conditions then the 
algorithm returns null. 

Definition 1 (operator :) Consider two lists of typed X-elements A and B, (ai, ...,an) and (bj, ...,bn) re- 
spectively and a formula H. The result of the operation H{A : B) is obtained by replacing ai by hi, for each 
appearance of A in H. 



Next, we present the definition of an inverse operator^ Inverseji{H, G): 



"A sentence can have several different parses. 

'This is the operator that was used in this implementation. In a companion work we develop an enhancement of this operator 
which is proven sound and complete. 
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Definition 2 (Inverseji{H, G)) The junction Inverseji{H, G), is defined as: 
Given G and H: 

1. IfG is Xv.v@J, set F = InverscLiH, J) 

2. If J is a sub term ofH and G is Xv.H{J : v) then F = J. 

3. IfG is not Xv.v@J, J is a sub term of H and G is X'W.H{J{Ji, J^) : w@Jp, @Jq) with 1 < 
p,q,s<m. thenF = Xvi, ...,Vs.J{Ji, Jm ■ Vp,---,Vq). 

The function InverseL{H, G) is defined similarly. 
Illustration: Inverse^ - Case 3: 

Suppose H = in{river, Texas) and G = Xv .v@Texas@river 

G is not of the form Xv.v@J since J = Texas@river is not a formula. Thus the first condition is not 
satisfied. Similarly, there is no J that satisfies the second condition. Thus let us try to find a suitable J 
that satisfies third condition. If we take Ji = river and J2 = Texas, then the third condition is satisfied 
by G = Xx.H{{J{Ji, J2) : x@J2@Ji), which in this case corresponds to G = Xx.H{in{river, Texas) : 
x@Texas@river). Thus, F = Xvi,V2-J{Ji, J2 '■ V2,vi) and so F = Xvi,V2-in{v2, vi). 
It is easy to see that G @ F = H. 

3.3 Generalization 

Using INVERSE_L and INVERSE_R, we are able to obtain new semantic representations of particular 
words in the sentence. However, without any form of generalization, we are not able to extend these to 
words beyond the ones actually contained in the training data. Since our goal is to go beyond that, we strive 
to generaUze the new semantic representations beyond those words. 

To extend our coverage, a function that will take any new learned semantic expressions and the current 
lexicon and will try to use them to obtain new semantic expressions for words of the same category has to be 
designed. It will use the following idea. Consider the non-transitive verb "fly" of category S\NP. Lets as- 
sume we obtain anew semantic expression for "fly" as Xx.fly{x) using INVERSEJL, and INVERSEJi. 
The GENERALIZE function looks up all the words of the same syntactic category, S\NP. It then iden- 
tifies the part of the semantic expression in which "fly" is involved. In our particular case, it's the subex- 
pression fly. It then proceeds to search the dictionary for all the words of category S\NP. For each such 
word w, it will add a new semantic expression Xx.w{x) to the dictionary. For example for the verb "swim", 
it would add Xx.swim{x). 

However, the above idea also comes with a drawback. It can produce a vast amount of new semantics 
representations that are not necessary for most of the sentences, and thus have a negative impact on per- 
formance. Thus instead of applying the above idea on the whole dictionary, we perform generalization "on 
demand". That is, if a sentence contains words with unknown semantics, we look for words of the same 
category and use the same idea to find their semantics. Let us assume I DENT I FY {word, semantics) 
identifies the parts of semantics in which word is involved and REPLACE{s, a, b) replaces a with b in 
s. We assume that each lexical entry is a triple {w, cat, sem) where w is the actual word, cat is the syntactic 
category and sem is the semantic expression corresponding to w and cat. 

GENERALIZEd{L, a) 

• For each Ij G L 

— lflj{cat) = a{cat) 

* I = IDENTIFY {lj{w),lj{sem)) 
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* S = REPLACE{lj{sem),I,a{w)) 

* L = L U {a{w), a{cat), S) 

As an example, consider the sentence "Give me the largest state." from Table [U Let us assume that 
the semantics of the word "largest" as well as "the" is not known, however the semantics of "longest" is 
given by the dictionary as Xx.Xy.longest{y, x@y). Normally, the system would be unable to parse this sen- 
tence and would continue on. However, upon calling GEN ERALIZEj:){L, ''largest"), the word longest 
is found in the dictionary with the same syntactic category. Thus this function takes the semantic representa- 
tion of "longest" Xx.Xy.longest{y, x@y), modifies it accordingly for largest, giving Xx.Xy.largest{y, x@y) 
and stores it in the lexicon. After that, the INVERSEi and INVERSE^ can be appUed to obtain the 
semantics of "the". 

3.4 Trivial inverse solutions 

Even with on demand generalization, we might still be missing large amounts of semantics information to be 
able to use INVERSE^ and INVERSE^. To make up for this, we allow trivial solutions under certain 
conditions. A trivial solution is a solution, where one of the formulas is assigned a Xx.x representation. 
For example, given H, we are looking for F such that H = G@F. If we set G to be Xx.x, then trivially 
F = H. Thus we can try to carefully set some unknown semantics of words as Xx.x which will allow us to 
compute the semantics of the remaining words using INVERSEl and INV ERSEr- The question then 
becomes, when do we allow these? In our approach, we allow these for words that do not seem to have any 
contribution to the final semantic meaning of the text. In some cases, articles such as "the", while having a 
specific place in the English language, might not contribute anything to the actual meaning representation of 
the sentence. In general, any word not present in the final semantics is a potential candidate to be assigned 
the trivial semantic representation Xx.x. These are added with very low weights compared to the semantics 
found using INVERSEl and INV ERSEr, so that if at one point a non-trivial semantic representation is 
found, the system will attempt to use it over the trivial one. 

As an example, consider again the sentence "Give me the largest state." from Tabled] with the semantics 
answer{A, largest{A, state{A))). Let us assume the semantic representations of "the" and "largest" are 
not known. Under normal circumstances the algorithm would be unable to find the semantics of "largest" 
using INVERSE]^ and INVERSE^ as it is missing the semantics of "the". However, as "the" is not 
present in the desired semantics, the system will attempt to assign Ax.x as its semantic representation. After 
doing that, INVERSE^ and INVERSE^ can be used to compute the semantic representation of "largest" 
as Xx.Xy.largest{y,x@y). 

3.5 The overall learning algorithm. 

The complete learning algorithm used within our approach is shown below. The input to the algorithm is an 
initial lexicon Lq and a set of pairs {Si,Li),i = 1, ...,n, where Si is a sentence and Lj its corresponding 
logical form. The output of the algorithm is a PCCG defined by the lexicon Lt and a parameter vector Qt- 

The parameter vector 0j is updated at each iteration of the algorithm. It stores a real number for each 
item in the dictionary. The initial values were set to 0.1. The algorithm is divided into two major steps, 
lexical generation and parameters update. The goal of the algorithm is to extract as much information as 
possible given the provided training data. 

In the first step, the algorithm iterates over all the sentences n times and for each sentence constructs a 
syntactic and (potentially incomplete) semantic parse tree. Using the semantic parse tree, it then attempts 
to obtain new A-calculus formulas by traversing the tree and performing regular applications and inverse 
computations where possible. Any new semantics are then generalized and stored in the lexicon. 
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The main reason to iterate over all the sentences n times is to extract all the possible information given 
the current parameter vector. There may be cases where the information learned from the last sentence can be 
used to learn additional information from the third sentence, which can then be used to learn new semantics 
from the second sentence etc. By looping over all sentences n times, we ensure we capture and learn as 
much information as possible. 

Note that the semantic parse trees of the sentences may change once the parameters of words change. 
Thus even though we are looping over all the sentences T times, the semantic parse tree of a sentence might 
change as a result of a change in the parameter vector. This change can be very minor, such as change in the 
semantics of a single word, or in a rare case a major one where most of the semantic expressions present in 
the tree change. Thus we might learn different semantics of words given different parameter vectors. 

In the second step, the parameter vector is updated using stochastic gradient descent. Steps one and 
two are performed T times. In our experiments, the value of T ranged from 50 to 100. 

Overall, steps one and two form an exhaustive search which optimizes the log-likelihood of the training 
model. 

• Input: 

A set of training sentences with their corresponding desired representations S = {{Si,Li) : i = l...n} 
where Si are sentences and Lj are desired expressions. Weights are given an initial value of 0.1. 

An initial lexicon Lq. An initial feature vector ©q. 

• Output: 

An updated lexicon Lt+i- An updated feature vector ©t+i- 

• Algorithm: 

- For t = 1 . . . T 

- Step 1: (Lexical generation) 

- For i = l...n. 

* For j = l...n. 

* Parse sentence Sj to obtain Tj 

* Traverse Tj 

■ apply INVERSE_L, INVERSEJi and GENERALIZED to find new A-calculus 
expressions of words and phrases a. 

* Set Lt+i = Lt Li a 

- Step 2: (Parameter Estimation) 

- Set et+i = UPDATEiet,Lt+S 

• return GENERALIZE{Lt, Lt), ©(T) 

4 Experimental Evaluation 
4.1 The data 

To evaluate our algorithm, we used the standard corpus in GEOQUERY and CLANG. The GEOQUERY 
corpus contained 880 English sentences with respective database queries. The CLANG corpus contained 

"'For details on & computation, please see the work bv lZettlemover and Collins! j2005t) 
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300 entries specifying rules, conditions and definitions in CLANG. The GEOQUERY corpus contained 
relatively short sentences with the sentences ranging from four to seventeen words of quite similar syntactic 
structure. The sentences in CLANG are much longer, with more complex structure with length ranging from 
five to thirty eight words. 

For our experiments, we used the C&C parser of Clark and CurranI ( 2007 ) to provide syntactic parses for 
sentences. For CLANG corpus, the position vectors and compound nouns with numbers were pre-processed 
and consequently treated as single noun. 

Our experiments were done using a 10 fold cross validation and were conducted as follows. A set of 
training and testing examples was generated from the respective corpus. These were parsed by the C&C 
parser to obtain the syntactic tree structure. These together with the training sets containing the training 
sentences with their corresponding semantic representations (SRs) and an initial dictionary was used to train 
a new dictionary with corresponding parameters. This dictionary was generalized with respect of all the 
words in the test sentences. Note that it is possible that many of the words were still missing their SRs. 
This dictionary was then used to parse the test sentences and highest scoring parse was used to determine 
precision and recall. Since many words might have been missing their SRs, the system might not have 
returned a proper complete semantic parse. 



To measure precision and recall, we adopted the measures given by iGe and Mooneyi (|2009l') . Precision 
denotes the percentage of of returned SRs that were correct, while Recall denotes the percentage of test 
examples with pre-specified SRs returned. F-measure is the standard harmonic mean of precision and recall. 
For database querying, an SR was considered correct if it retrieved the same answer as the standard query. 
For CLANG, an SR was correct if it was an exact match of the desired SR, except for argument ordering of 
conjunctions and other commutative predicates. Additionally, a set of additional experiments was run with 
"(definec" and "(definer" treated as being equal. 

We evaluated two different version of our system. The first one, INV ERSE, uses INV ERSEi and 
INVERSEji and regular generalization which is applied after each step. The second version, INV ERSE+, 
uses trivial inverse solutions as well as on demand generalization. Both systems were evaluated on the same 
data sets using 10 fold cross validation and the C&C parser using an equal number of train and test sen- 
tences, randomly chosen from their respective corpus. The initial dictionary contained a few nouns, with 
the addition of one randomly selected word from the set {what, where, which} in case of GEOQUERY. 
For CLANG, the initial dictionary also contained a few nouns, together with the addition of one randomly 
selected word from the set (i f , when, during}. The learning parameters were set to the values used by 
Zettlemoyer and Colhnsl (jlOOsh . 



4.2 Results 

We compared our systems with the performance results of several alternative systems for \ yhich the perfor- 
mance data is available in the literature. In particular, we used the performance data given bv lGe and MoonevI 
(2009*). The systems that we co mpared with are: The SY NO, SYN20 and GOLDS YN systems by'Ce and Mooney 
(2009), the system SCISSOR by iCe and Mooneyi (l2005l ). an S VM based system KRI PS by Kate and Moone^ 
(120061). a synchronous grarn r nar ba sed system WASP by IWong and MoonevI (120070 . the CCG based system 
by Zettlemoyer and CoUinsI (2007) and the work by Lu et al. ( 2008 ). Please note that many of these ap- 
proaches require different parsers, human supervision or other additional tools, while our approach requires 
a syntactic parse of the sentences and an initial dictionary. 

Our and their reported results for the respective corpora are given in the Tables |2] and [3] 
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Precision 


Recall 


F-measure 




Precision 


Recall 


F-measure 




INVERSE+(i) 


87.67 


79.08 


83.15 


INVERSE+ 


93.41 


89.04 


91.17 




INVERSE+ 


85.74 


76.63 


80.92 


INVERSE 


91.12 


85.78 


88.37 




GOLDSYN 


84.73 


74.00 


79.00 


GOLDSYN 


91.94 


88.18 


90.02 




SYN20 


85.37 


70.00 


76.92 


WASP 


91.95 


86.59 


89.19 




SYNO 


87.01 


67.00 


75.71 


z&c 


91.63 


86.07 


88.76 




WASP 


88.85 


61.93 


72.99 


SCISSOR 


95.50 


77.20 


85.38 




KRISP 


85.20 


61.85 


71.67 


KRISP 


93.34 


71.70 


81.10 




SCISSOR 


89.50 


73.70 


80.80 


Lu at al. 


89.30 


81.50 


85.20 




Lu at al. 


82.50 


67.70 


74.40 



Table 2: Performance on GEOQUERY. Table 3: Performance on CLANG. 

The INVERSE + (i) denotes training where "(definec" and "(definer" at the start of SRs were treated 
as being equal. The main reason for this was that there seems to be no way to distinguish in between them. 
Even as a human, we found it hard to be able to distinguish between them. 



4.3 Analysis 

Our testing showed that our method is capable of outperforming all of the existing parsers in F-measure. 
However, there are parsers which can produce greater precision, such as WASP and SC ISSOR on CLANG 
corpus, however they do at the cost in recall. As discussed by iGe and Mooney (l2009h . the GEOQUERY 



results for SCISSOR, KRISP and Lu's work use a different, less accurate representation language FUNSQL 
which may skew the results. Also, SCISSOR outperforms our system on GEOQUERY corpus in terms of 
precision, but at the cost of additional human supervision. 

Our system is particularly accurate for shorter sentences, or a corpus where many sentences have similar 
general structure, such as GEOQUERY. However, it is also capable of handling longer sentences, in particular 
if they in fact consists of several shorter sentences, such as for example "If the ball is in our midfield, position 
player 3 at (-5,-23).", which can be looked at as "IF A, B" where "A" and "B" are smaller complete sentences 
themselves. The system is capable of learning the semantics of several basic categories such as verbs, after 
which most of the training sentences are easily parsed and missing semantics is learned quickly. The inabiUty 
to parse other sentences mostly comes from two sources. First one is if the test sentence contains a syntactic 
category not seen in the training data. Our generalization model is not capable of generalizing these and thus 
fails to produce a semantic parse. The second problem comes from ambiguity of SRs. During training, many 
words will be assigned several SRs based on the training data. The parses are then ranked and in several 
cases, the correct SR might not be on the top. Re-ranking might help alleviate the second issue. 

Unlike the other systems, we do not make use of a grammar for the semantics of the sentence. The reason 
it is not required is that the actual semantics is analyzed in computing the inverse lambdas, and the richness 
of A-calculus allows us to compute relatively complex formulas to represent the semantic of words. 

We also run examples with increased size of training data. These produced larger dictionaries and in 
general did not significantly affect the results. The main reason is that as discussed before, once the most 
common categories of words have their semantics assigned, most of the sentences can be properly parsed. 
Increasing the amount of training data increases the coverage in terms of the rare syntactic categories, but 
these are also rarely present in the testing data. The used training sample was in all cases sufficient to learn 
almost all of the categories. This might not be the case in general, for example if we had a corpus with all 
of the sentences of a particular length and structure, our method might not be capable of learning any new 
semantics. In such cases, additional words would have to be added to the initial dictionary, or additional 
sentences of varying le ngths would h ave to be added. 



The C&C parser of IClark and Curran (2007 ) was primarily trained on news paper text and thus did have 



some problems with these different domains and in some cases resulted in complex semantic representations 
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of words. This could be improved by using a different parser, or by simply adjusting some of the parse trees. 
In addition, our system can be gradually improved by increasing the size of initial dictionary. 

5 Conclusions and Discussion 

We presented a new approach to map natural language sentences to their semantic representations. We used 
an existing syntactic parser, a novel inverse A operator and several generalization techniques to learn the 
semantic representations of words. Our method is largely independent of the target representation language 
and directly computes the semantic representations based on the syntactic structure of the syntactic parse 
tree and known semantic representations. We used statistical learning methods to assign weights to different 
semantic representation of words and sentences. 

Our results indicate that our approach outperforms many of the existing systems on the standard corpora 
of database querying and robot command and control. 

We envision several directions of future work. One direction is to experiment our system with corpora 
where the natural language semantics is given through other Knowledge Representation languages such as 
answer set programming (ASPfl and temporal logic. We are currently building such corpora. Another 
direction is to improve the statistical learning part of the system. An initial experimentation with a different 
learning algorithm shows significant decrease in training time with slight reduction in performance. Finally, 
since our system uses an initial dictionary, which we tried to minimize by only having a few nouns and one 
of the query words, exploring how to reduce it further and possibly completely eliminating it is a future 
direction of research. 
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